Stance Detection Benchmark: How Robust is Your Stance Detection?

نویسندگان

چکیده

Abstract Stance detection (StD) aims to detect an author’s stance towards a certain topic and has become key component in applications like fake news detection, claim validation, or argument search. However, while is easily detected by humans, machine learning (ML) models are clearly falling short of this task. Given the major differences dataset sizes framing StD (e.g. number classes inputs), ML trained on single usually generalize poorly other domains. Hence, we introduce benchmark that allows compare against wide variety heterogeneous datasets evaluate them for generalizability robustness. Moreover, framework designed easy integration new probing methods Amongst several baseline models, define model learns from all ten various domains multi-dataset (MDL) setting present state-of-the-art results five datasets. Yet, still perform well below human capabilities even simple perturbations original test samples (adversarial attacks) severely hurt performance MDL models. Deeper investigation suggests overfitting biases as main reason decreased Our analysis emphasizes need focus robustness de-biasing strategies multi-task approaches. To foster research important topic, release splits, code, fine-tuned weights.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

360{\deg} Stance Detection

The proliferation of fake news and filter bubbles makes it increasingly difficult to form an unbiased, balanced opinion towards a topic. To ameliorate this, we propose 360° Stance Detection, a tool that aggregates news with multiple perspectives on a topic. It presents them on a spectrum ranging from support to opposition, enabling the user to base their opinion on multiple pieces of diverse ev...

متن کامل

Stance detection in online discussions

This paper describes our system created to detect stance in online discussions. The goal is to identify whether the author of a comment is in favor of the given target or against. Our approach is based on a maximum entropy classifier, which uses surface-level, sentiment and domain-specific features. The system was originally developed to detect stance in English tweets. We adapted it to process...

متن کامل

Stance Detection in Turkish Tweets

Stance detection is a classi€cation problem in natural language processing where for a text and target pair, a class result from the set {Favor, Against, Neither} is expected. It is similar to the sentiment analysis problem but instead of the sentiment of the text author, the stance expressed for a particular target is investigated in stance detection. In this paper, we present a stance detecti...

متن کامل

Exploring Various Linguistic Features for Stance Detection

In this paper, we describe our participation in the fourth shared task (NLPCC-ICCPOL 2016 Shared Task 4) on the stance detection in Chinese Micro-blogs (subtask A). Different from ordinary features, we explore four linguistic features including lexical features, morphology features, semantic features and syntax features in Chinese micro-blogs in stance classifier, and get a good performance, wh...

متن کامل

A Dataset for Multi-Target Stance Detection

Current models for stance classification often treat each target independently, but in many applications, there exist natural dependencies among targets, e.g., stance towards two or more politicians in an election or towards several brands of the same product. In this paper, we focus on the problem of multi-target stance detection. We present a new dataset that we built for this task. Furthermo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Ki - Künstliche Intelligenz

سال: 2021

ISSN: ['1610-1987', '0933-1875']

DOI: https://doi.org/10.1007/s13218-021-00714-w